Syntactic Recovery and Spelling Correction of Ill-formed Sentences
نویسندگان
چکیده
This paper describes syntactic repair and spelling correction of ill-formed sentences within a context-free grammar using non-static filtering, of ill-formed sentences which violate subjectverb agreement or premodifier-noun agreement. The system described here provides recovery of local trees, reconstruction of the sentence, and spelling correction of detected typographical errors. It also produces a report on the repair that has been carried out. The system includes generalised problem-solving strategies for detecting and correcting several types of syntactic and typographical errors, and also includes heuristics. The paper focuses on a system for the integration of lexical and syntactic recovery of ill-formed sentences at the local tree and the final goal (sentence) level. The system is based on a chart parser and employs a mixed top-down/bottom-up strategy together with left-to-right and rightto-left parsing. The implementation is composed of two chart parsers: a well-formed sentence chart parser (WFSCP) and an ill-formed sentence chart parser (IFSCP), and a spelling correction algorithm based on dictionary lookup.
منابع مشابه
Integrated Correction of Ill-Formed Sentences
This paper describes a system that performs hierarchical error recovery, and detects and corrects a single error in a sentence at the lexical, syntactic, and/or semantic levels. If the system is unable to repair an erroneous sentence on the assumption that it has a single error, a multiple error recovery system is invoked. The system employs a chart parsing algorithm and uses an augmented conte...
متن کاملRobust Parsing Based on Discourse Information: Completing Partial Parses of Ill-Formed Sentences on the Basis of Discourse Information
In a consistent text, many words and phrases are repeatedly used in more than one sentence. When an identical phrase (a set of consecutive words) is repeated in different sentences, the constituent words of those sentences tend to be associated in identical modification patterns with identical parts of speech and identical modifiee-modifier relationships. Thus, when a syntactic parser cannot pa...
متن کاملTwo modes of assessment: the case of academicians' writing
This study attempted to investigate writing problems and the relationship between expert-assessment and self-assessment of writing problems. Participants were thirty four non-English faculty members of Tehran and Guilan universities. The instruments were writing an essay on the topic "What teaching strategies do you use in your classes?" in twenty five lines and filling the questionnaire of wri...
متن کاملInterpreting Syntactically Ill-Formed Sentences
The paper discusses three different kinds of syntactic ill-formedness: ellipsis, conjunctions, and actual syntactic errors. It is shown how a new grammatical formalism, based on a two-level repr_e sentation of the syntactic knowledge is used to cope with Ill-formed sentences. The basic control struc ture of the parser is briefly sketched; the paper shows that it can be applied without any subst...
متن کاملJoint English Spelling Error Correction and POS Tagging for Language Learners Writing
We propose an approach to correcting spelling errors and assigning part-of-speech (POS) tags simultaneously for sentences written by learners of English as a second language (ESL). In ESL writing, there are several types of errors such as preposition, determiner, verb, noun, and spelling errors. Spelling errors often interfere with POS tagging and syntactic parsing, which makes other error dete...
متن کامل